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Abstract — Ternary channels can be used to model the behavior 
of some memory devices, where information is stored in three 
different levels. In this paper, error correcting coding for a 
ternary channel where some of the error transitions are not 
allowed, is considered. The resulting channel is non-symmetric, 
therefore classical linear codes are not optimal for this channel. 
We define the maximum-likelihood (ML) decoding rule for 
ternary codes over this channel and show that it is complex 
to compute, since it depends on the channel error probability. 
A simpler alternative decoding rule which depends only on code 
properties, called g(a -decoding, is then proposed. It is shown that 
dA -decoding and ML decoding are equivalent, i.e., dA -decoding 
is optimal, under certain conditions. Assuming c?a -decoding, we 
characterize the error correcting capabilities of ternary codes 
over the non-symmetric ternary channel. We also derive an upper 
bound and a constructive lower bound on the size of codes, given 
the code length and the minimum distance. The results arising 
from the constructive lower bound are then compared, for short 
sizes, to optimal codes (in terms of code size) found by a clique- 
based search. It is shown that the proposed construction method 
gives good codes, and that in some cases the codes are optimal. 



I. Introduction 

Error correcting coding plays a central role in any com- 
munication system. Since the seminal paper by Shannon, the 
main body of research on coding theory has been devoted 
to binary linear codes. However, non-binary codes have also 
demonstrated remarkable performance. Among them, Reed- 
Solomon codes [1] are one of the most popular and widely 
used coding schemes. Recently, the interest for non-binary 
codes has been renewed with the rediscovery of low-density 
parity-check (LDPC) codes [2]. Non-binary LDPC codes 
have been shown to perform very close to capacity and to 
outperform binary LDPC codes in some cases [3]. 

Most of the previous works on non-binary codes consider 
a Galois Field whose order q is a power of 2. On the other 
hand, little attention has been devoted to non-binary codes 
when q is not a power of two. Indeed, for conventional 
channels, binary linear codes or g-ary codes with q being a 
power of two show very good performance. Ternary codes for 
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Fig. 1. Non-symmetric ternary channel. 



the symmetric ternary channel using the ordinary Hamming 
distance metric have been considered in the literature before. 
See, for instance, [4, 5] and references therein. In this paper, 
however, we consider error correcting coding for a ternary 
non-conventional channel. 

Recently, coding for flash memories has received some 
attention. See, for instance, [6-8] and references therein. 
Multilevel flash memory is a storage technology where the 
charge level of any cell can be easily increased, but not easily 
decreased. In fact, the only way to decrease the charge level 
of a cell is to erase the whole block (i.e., set the charge on 
all cells in a block to zero) and reprogram each cell. This is a 
time-consuming process which consumes energy and reduces 
the lifetime of the memory. The coding problem for flash 
memories is to design modulation codes that maximize the 
number of rewrites between two erasures. 

In this paper, however, we look at a different memory 
device coding problem, namely coding for electrically erasable 
programmable read-only memories (EEPROMs), which are 
semiconductor memories that retain their data contents when 
power is off. They can be read and written to like standard 
RAMs and are suitable for applications where storage of small 
amounts of data is critical and periodic writing of new data 
is required. Typical applications are radio frequency identifi- 
cation tag, smart dust, or automotive applications including 
car audio and multimedia, chassis and safety, and power 
train. The communication channel underlying EEPROMs can 
be suitably modeled as a binary symmetric channel (BSC). 
Currently, very simple error correcting codes based on the 
well-known Hamming codes combined with hard decoding 
are implemented on-chip to correct single bit errors [9]. 
However, next generation devices demand for more stringent 
requirements in terms of reliability as well as storage density. 
A suitable modification of the physics of EEPROM memories 
allows to store the information in three levels, thus higher 
densities can be achieved. While transitions between adjacent 
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Section II, we outline some of the notation used in this 
paper. Then, in Section III, we address the computation of 
the channel capacity. The ML decoding rule for ternary codes 
over the non-symmetric channel is given in Section IV. Also, a 
simpler decoding rule which depends only on code properties 
is derived. Section V addresses the error correcting capabilities 
of ternary codes over the non-symmetric ternary channel. In 
Section VI, an upper bound on the size of codes is given, 
and in Section VII a constructive lower bound is derived. An 
encoding algorithm is given in Section VIII. The construction 
method is generalized to q-ary non-symmetric channels in 
Section IX, and in Section X we compare the values from the 
constructive lower bound with the results of a clique-based 
search. Finally, in Section XI, we draw some conclusions. 



Fig. 2. Ratio of symbols in the optimal input distribution p(x) that 
maximizes I(X, Y) of channel Wasa function of p. 



levels are allowed, transitions from the highest level to the 
lowest level and from the lowest level to the highest level 
are physically not possible. A simple model for the resulting 
channel is the discrete memoryless ternary channel with input 
alphabet X = {0,1,2}, output alphabet y = {0,1,2}, and 
probability transition matrix 



l-p p/2 p/2 
P(y\x) = | P/2 1 -p/2 

p/2 l-p/2 



(1) 



where p < 2/3 and the entry in the ith row and the jth column 
denotes the conditional probability of receiving symbol j when 
symbol i was transmitted. Notice that transitions 1 — > 2 and 
2 — > 1 are not allowed. As a result, the channel defined by (1) 
is non-symmetric. The channel model is depicted in Fig. 1. 

In this paper, we consider error correcting coding for the 
non-symmetric ternary channel of Fig. 1. We define the 
maximum-likelihood (ML) decoding rule for ternary codes 
over this channel and show that its implementation is complex, 
since it depends on the channel error probability p. As an 
alternative, a simpler decoding rule which depends only on 
code properties, called dA-decoding, is proposed based on 
a more appropriate distance measure. It is shown that under 
certain conditions the proposed decoding rule is optimal, i.e., it 
is equivalent to ML decoding. We then address error correcting 
capabilities of ternary codes under ^A-decoding. In particular, 
we derive a sphere-packing bound to upper bound the size of 
the codes assuming rfA-decoding. We also derive a constructive 
lower bound on the size of the codes given the code length 
and its minimum distance, which proves the existence of good 
codes. The construction method is based on binary block codes 
as basic elements. The construction method is then generalized 
to a non-symmetric q-aiy channel. Finally, for ternary codes 
of small sizes, we compare the constructive lower bounds to 
optimal codes (in terms of code size) found by a clique-based 
code search. It is shown that the binary code construction 
method gives very good codes. Also, in some cases, optimal 
ternary codes are obtained. 

The remainder of the paper is organized as follows. In 



II. Notation 

Throughout the paper, we use capital letters to denote 
random variables, e.g., X, and its calligraphic version, X, to 
denote the alphabet of X. Also, for convenience, we denote 
the probability mass function by p(x) = Yi(X = x),x £ X, 
rather than by px (x). We will write vectors in boldface letters, 
and the ith element of a vector a as a^. The cardinality of a 
set S, i.e., the number of elements in S is denoted by |<S|. 
Furthermore, a q-ary code C of length n is a subset of Z™, 
q > 2, where Z g = {0, . . . , q — 1} and Z™ is the set of all n- 
tuples over Z g . We will use subindexes to distinguish between 
codes over alphabets of different order q. For instance, a binary 
code will be denoted by C2 and a ternary code by C3. A code 
C with code length n containing M = \C\ codewords and 
minimum distance d shall be referred to as an [n, M, d] code. 
The Hamming weight of a vector a is denoted by w a and the 
Hamming distance between two vectors a and b is denoted by 
dn(a, b). For simplicity, we shall denote the non-symmetric 
ternary channel of Fig. 1 by H. 

III. Channel Capacity 

In this Section, we derive the capacity (C) for the channel 
model of Fig. 1, defined as 



C = m&xI(X,Y) 

p(x) 



(2) 



where I(X, Y) is the mutual information between X and Y . 
We denote by p x , x £ {0, 1, 2}, the probability Pr(A" = x). 
Due to the symmetry between symbols 1 and 2, we can 
assume that the input distribution p(x) = (po,pi,P2) which 
maximizes I(X,Y) will be such that pi = p 2 . Thus, the 
distributions we are interested in are entirely characterized 
by p and take the form p(x) = (p , ^p-)- With this 
constraint, I(X, Y) can be written as 



I(X,Y) = h[ Po ^ 
+ (1 



p 
2 

Po)(l 



PoP 



-p h(p) - (l-po)h (|) 



log 3 (2) 



(3) 



where h(t) = —tlog 3 (t) - (1 - t)log 3 (l - t) is the ternary 
entropy function. 
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Fig. 3. Channel capacity of the ternary channel of Fig. 1 and of the symmetric 
ternary channel as a function of the error probability p. 



Let / be the function / = dI ^^\ the partial derivative of 
I(X, Y) with respect to po- The zeros of the function / are 
the values of po that maximize I(X, Y). We denote the values 
of po which maximize I(X, Y) by p^. p$ can be written as 



* _ 1+3MP) ~ 2 
P0 ~^IP- 



(4) 



where 



1 2 p 

The values of p^ are given in Fig. 3 as a function of p. 
Since H is not symmetric, the input distribution p(x) that 
maximizes the mutual information I(X, Y) is not uniform. 
For very low values of p, the best input distribution tends to 
the uniform distribution. However, for increasing values of p 
the optimal distribution tends to favor the symbols 1 and 2, 
and symbol should be less used. There is a point after which 
the best distribution is p(x) = (0, 1/2, 1/2} for some range of 
values of p. This implies that symbol should not be used 
for this range of transition error probabilities. In that case, the 
optimal codes are binary codes on symbols {1,2}. For values 
of p approaching one, the best distribution tends again to the 
uniform distribution. 

The channel capacity is depicted in Fig. 3 as a function 
of p. For comparison purposes, the capacity of the symmetric 
ternary channel is also given. Clearly, the capacity of the non- 
symmetric ternary channel is higher. 

IV. ML Decoding and c?a-decoding 

In this Section, we give the ML decoding rule for the 
ternary channel H of Fig. 1. We then propose an alternative 
decoding rule, called c?a -decoding, which is much simpler to 
compute, and show that both rules are equivalent under certain 
conditions. 

For later use, let C 3 C Z3 be a ternary code of length n. 
Also, let x = (xi, . . . ,x n ) be a codeword in C 3 which is 
transmitted over channel H, and y = (y 1 , . . . , y n ) € Z3 the 



received noisy observation at the output of the channel. The 
user data is assumed to be uniformly distributed, and thus also 
the codewords. 

Let u, v e Z3 be two ternary vectors, and define the subsets 
So, Si, S2, and £3 as 

50 = {i : Ui = Vi = 0} 

51 = {i : Ui = Vi 7^ 0} 

5 2 = {i : Ui ^ Vi A UiVi = 0} 

5 3 = {i : Ui ^ Vi A ^ 0} . 

We define the following distance measure between two ternary 
vectors u and v transmitted over channel Tt: 

Definition 1: Let u = (u\, . . . , u n ) and v = (vi, . . . , v n ) 
be two vectors in Z 3 l transmitted over channel H with error 
probability p. The distance e?ML(u,v) between u and v is 
defined as 



(6) 



g?ml(u, v) 



00 

-|5b|log(l-p) 

-|5i|log(l-p/2) 

-\S 2 \log(p/2) 



if 1 5.3 1 > 



otherwise 



(7) 



Remark 1: Notice that with some abuse of language, we 
call duL a distance measure. However, formally speaking c2ml 
is not a distance measure, since the identity of indiscernibles 
does not hold. Also, note that for symmetric channels the 
distinction between subsets So and <Si is not necessary, since 
the conditional probabilities p(y\x) are independent of x. 
Similarly, the distinction between subsets £2 and 1S3 is not 
required for symmetric channels. 

We can express the ML decoding rule 



x = argmaxp(y|x) 

xGC 3 



(8) 



in terms of the distance c?ml(x, y). By taking the logarithm 
of the conditional probability p(y|x) we obtain: 

n 

- log(p(y|x)) = ^2 ~ lo g(p(j/»l^)) = <4iL(x,y) (9) 

i=l 

where the first equality is due to the assumption that the 
channel is memoryless. Using (9), the ML decoding rule can 
then be formulated as follows: 

Given a received word y, decode to the codeword x 
that minimizes the distance c?ml (x, y). 

Proof: It is sufficient to prove that for a given y, when x 
varies among codewords, 4il (x, y) increases for decreasing 
values of p(y|x). For c?ml(x, y) = 00 there is at least one 
position i such that the transition Xi — ► yi is not permitted; 
therefore p(y|x) = 0. Now, we consider the case where 
^ml(x, y) < 00, in which case 



p(y|x) = exp(-d M L(x,y)) . 



(10) 



Then, the result follows from the monotonicity of the expo- 
nential function. ■ 
Notice that c? ML depends on both the code and the channel 
transition probability p. However, one would be interested in 
a distance metric that depends only on the code, thus allowing 
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Fig. 4. Maximum value of p, p m ax, for the equivalence between 
decoding and ML decoding rules as a function of code length n. 



for a simpler decoding rule. We define the following distance 
measure between two ternary vectors u and v: 

Definition 2: Let u and v be two vectors in Z3. The 
distance d A (u, v) between u and v is defined as 



d A (u,v) = y^d A (ui,Vi) 



(11) 



where 



d A (ui,Vi) 



if u t = Vi, 

1 if u t ^ Vi A UiVi = 0, (12) 
00 if Ui 7^ Vi A u-iVi 7^ 0. 

Remark 2: Notice that in this case the identity of indis- 
cernibles holds. However, the triangular inequality does not 
hold anymore. 

Using the distance d A (u,w) we can define the following 
decoding rule which does not depend on p: 

Given a received word y, decode to the codeword x 
that minimizes the distance d A (x, y). 

In the remainder of the paper we shall refer to this decoding 
rule as d A -decoding. We denote by t A the error correcting 
capability of a code C 3 over the channel H under dA-decoding. 
Note that dA-decoding does not necessarily minimize the 
probability of error. However, we can prove the following 
Theorem: 

Theorem 1: Let C3 be a ternary code of length n, and 
let H be the ternary channel of Fig. 1 with transition error 
probability p. dA-decoding and ML decoding of codewords 
transmitted over TL are equivalent for all codes C3 of length n 
if and only if the following inequality is satisfied: 

\ - — ! • (13) 



p/2 



vIV 



i-p ~ \i 

Proof: See Appendix A. ■ 
Theorem 1 gives the range of values of the channel error 
probability p such that d A -decoding is equivalent to ML 
decoding. We denote the maximum value of p such that d A - 
decoding and ML decoding are equivalent by p max . The value 



Pmax is depicted in Fig. 4 as a function of the code length n. 
dA-decoding and ML decoding are equivalent for all values of 
p under the curve. For small values of n, dA-decoding and ML 
decoding are equivalent even for high values of p. The range of 
p for which dA-decoding is optimal decreases with the block 
length n. For instance, for n ~ 100, dA-decoding is optimal 
for p < 0.1. This is by far compatible with requirements of 
memory devices. Therefore, for practical purposes, the dA- 
decoding rule is optimal and can be considered instead of the 
more complex ML decoding rule with no loss in performance. 

V. Error Correcting Capabilities 

In the following, we analyze the distance properties and the 
error correcting capabilities of ternary codes over the ternary 
channel H under the dA -decoding rule defined in the previous 
Section. We require the definition of another distance measure: 

Definition 3: Let u and v be two vectors in Z 3 \ The 
distance d-^iu, v) between u and v is defined as 

d B (u,v)= min (d A (u,w) + d A (w, v)) . (14) 

It is easy to check that de is such that for two ternary 
symbols u i} Vi e Z 3 the following equalities are satisfied: 

{0 if Ui — Vi, 
1 if Ui ^ Vi A u^i = 0, (15) 
2 if Ui ^ Vi A u t Vi ^ 0. 

Remark 3: Notice that for two binary vectors u and v, 
d B (u, v) = d H (u, v). 

We define the minimum de -distance of a ternary code, 
denoted by dB, m in, as follows: 

Definition 4: Let x and x be two distinct codewords of C3. 
The minimum de -distance of code C3 is 



c?B,min = min d B (x,x). 
x,xec 3 

x^x 



(16) 



Then, assuming dA-decoding, the error correcting capability 
£a of a ternary code over the channel H is given by the 
following Proposition: 

Proposition 1: The error correcting capability t A of a code 
C3 over the ternary channel TL is 

^B,min — 1 



Proof: See Appendix B. 



(17) 



VI. A Sphere-packing Bound 

The main goal when designing codes is that of achieving the 
largest possible minimum distance with the highest possible 
code rate. In this Section, we give a simple upper bound on 
the size of codes over the ternary channel TL assuming dA- 
decoding. In particular, we derive a sphere-packing bound. 
However, its formulation is harder than for the case of sym- 
metric channels. Since transitions 1^2 and 2 — > 1 are not 
possible, the ternary space we deal with is not isotropic and 
has the shape of a hypercube of dimension n centered on 
the all-zero vector (see Fig. 5 for n — 3). Therefore, spheres 
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Fig. 5. Z| with distance cZb- 



have smaller volumes if they are closer to the vertices of the 
hypercube. The goal here is to find how many spheres of a 
given radius can be packed in the ternary space. Denote by 



<S(u, 



{v£ZJ : d B (u,v) <r} 



(18) 



C'2 



- e 2 
2e 2 



(19) 



the sphere with center u and radius r in Z3, and its volume 
by |«S(u, r)\. The following Proposition gives a lower bound 
on the value of |5(u, r)\: 

Proposition 2: Let <S(u, r) be a sphere with center u and 
radius r in Z3 of volume |«S(u, r)\. It follows that 

r \d/2\ 

|5(u,r)| > EE 

d=0 e 2 =0 

where the bound is attained by spheres centered on the vertices 
of the hypercube. 

Proof: See Appendix C. ■ 

It is now possible to formulate the sphere-packing bound 
for our channel. 

Theorem 2: Let C3 be a ternary code of length n and 
minimum de-distance d B , m in over the ternary channel H. It 
follows that 

\Cs\ < . — • (20) 



tA Ld/2J 

EE 

d=0 e 2 =0 



e 2 



n - e 2 
d- 2e 2 



Proof: See Appendix D. ■ 
Note that the tightness of the upper bound in (20) worsens 
with increasing values of de, m in, since the tightness of the 
lower bound on the volume of the spheres given by Proposi- 
tion 2 also decays when de, m in increases. 

VII. Constructive Lower Bound 

In this Section, we give a constructive lower bound on the 
size of codes over channel H and show the existence of good 
codes. Given the code length n and the minimum de-distance 
dB,min, the goal is to construct an [n, M, dB.min] code C3 for 
the ternary channel H with reasonable M and error correcting 
capability tx given by (17). The proposed construction method 
uses binary codes as basic elements. In particular, we define 



mappings that are applied to binary codes to generate a set of 
codewords of Z 3 l that satisfies a given minimum de -distance 
dB,min- For clarity purposes, we first summarize the proposed 
construction method, and then formalize it. 



A. Sketch of the Construction Method 

The proposed construction method is a two-step procedure. 
First, we build a large amount of subspaces of Z3 such that 
the de-distance between any two subspaces is at least de^in 1 - 
To this aim, we consider an [n, M 2 , de, m in] binary code C 2 for 
the BSC, such that each codeword x G C 2 defines a subspace 
£ x of the ternary space. £ x is the set of ternary words yielding 
x when projected to binary words (by changing their symbols 
2 into symbol 1). Notice that the cardinality of subspaces £ x 
is |£ x | = 2"*. 

Example 1: Let x = 1100. Then, 

£x = {1100,1200,2100,2200} 

and |£ x | = 2 2 = 4. 

The use of codes for the BSC comes from the fact that the 
binary projection of the transmission chain (x G Z 3 l ) — > H — > 

(y G Z 3 l ) is (x G Z 2 l ) -> H -» (y e If), where H is the 
BSC. 

The second step of the code construction is then to select 
words within every subspace £ x that are distant from each 
other by at least de, m in- To this end, we consider £ x as a 
binary space Z^* and use a [w x , M^ *, dn.min] code C^* for 
the binary erasure channel (BEC), with minimum Hamming 
distance d H , m in > dB 2 """ ■ 

Example 2: Let x = 1100. Then, £ x is mapped to Z 2 by: 



1100 
2100 



00, 
10, 



1200 
2200 



01, 
11. 



Now, if we choose C| = {00,11}, then the selected ternary 
codewords in £ x are 1100 and 2200. 

Notice that if x G £ x is transmitted, the received vector y 
might not belong to £ x . If the receiver is able to determine 
that x G £ x , i.e., x = x, we know the position of the zeros 
of x. Therefore, only errors in the remaining positions (in the 
form 1 — > and 2^0) must be considered. These transitions 
correspond to erasures in a BEC, hence C^* must be a good 
code for the BEC able to correct at least t& errors. Notice that 
for a BEC this corresponds to dn.n 
Example 3: Let x = 2200 (x = 



in > tA + 1 = ^f" 

1100). Assume that y 
0200 was received and that the receiver is able to correctly 
estimate x = 1100. If Cf = {00, 11} was chosen, then 0200 
is mapped to ?1 where we use symbol ? to denote an erasure. 
Then, the decoder of C\ = {00, 11} will decode ?1 to 11, 
which corresponds to the ternary codeword 2200 in £1100 • 

The set of ternary vectors selected within the subspaces £ x 
using codes C™* forms the [n, M, de, m i n ] ternary code C 3 . 

'Here, the distance between two non-empty subsets Si and S2 of a metric 
space is defined as the minimum distance between any two elements si 6 Si 
and s 2 £ S2. 
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B. Mappings and Their Topological Properties 

Let u be a vector of Z 2 and denote by w u its Hamming 
weight. We denote by g u (j) (1 < j < w u ) the jth non-zero 
entry of u. We define the mapping ip u such that: 



in ■ 



(21) 



a i — > J2 a i e sUj) 

3=1 

where (ei)i<i<„ is the canonical basis of Z3. We also call £ u 
the subspace of Z3 defined by £ u = ip u (Zg""). 

Example 4: For u = 10011000, we have g u (l) = 1, 
g u (2) = 4, fif u (3) = 5, and ^ u (201) = 20001000. In general, 
the elements of £ u are the vectors of the form aOO&cOOO for 
a, 6, c G Z 3 . 

We define another mapping ijj that transforms a word in 
(Z 2 U {?})" into a ternary word by mapping — > 1, 1 — > 2, 
and ? -> 0: 

V: (Z 2 U{?})« - 
b h 

i=l 

where 

V>(?) = 0, -0(0) = 1, and V(l) = 2. (23) 

Eram/?/e 5: For 6 = 1101071?, we have -0(1101071?) = 
22121020. 

The mappings (21) and (22) have several topological prop- 
erties regarding d B : 

Proposition 3: Let u and v be two vectors in Z 2 , and ip 
the mapping defined in (22). It follows that 



7LI 



(22) 



d B (V(u),V(v))=2d B (u,v). 



(24) 



Proof: Since d B (l,2) = 2d B (0, 1) and d B (a,a) = for 
all a G Z3, we have 



d B (^(u), V(v)) = MM*))i, (V(v))i) 

n 

= ^d B (wi + + 1) 

i=l 
n 

= ^2d B (ui,Wi) = 2d B (u,v). 



(25) 



1=1 



Proposition 4: Let u G Z 2 . For u, u' G Z3 u , the following 
equality holds: 



d B (<£ u (u),</?u(u')) = d B (u,u')- 



(26) 



Proof: 



d B ((p u (u),(p u (u')) =d B X"'''- i -X!"/'- 



= y^d B (Mj,Mj) = d B (u, u'). 



(27) 



Proposition 5: Let u, v G Z 2 , and let u G Z^" and v G 
Zg° v , both with no zero entries. The following inequality holds: 

d B (<£ u (u), Vv(v)) > d B (u,v). (28) 

Proof: Since u and v have no zero entries: 

<fe(Vu(u), Vv(v)) > d B (yu(l«, u ),^v(l«, v )) = d B (u,v) 

(29) 

where l x denotes the all-one vector of length x. ■ 

C. Construction and Lower Bound 

Let C 2 be an [n, M 2 , de.min] binary code with minimum 
Hamming distance df^min = dB,min and denote by A d its 
weight enumerator (WE), the number of codewords of weight 
d (0 < d < n). For all values of d such that A d ^ 0, let C| 



be a [d, M 2 , dn, m in] binary code with dn 
consider the following ternary code: 

C 3 = (J ¥> : 

xec 2 



> 



(V(CD). 

Proposition 6: The cardinality of code C3 satisfies: 



|C 3 



d=0 



A d \C a 2 



. We 



(30) 



(31) 



Proof: Since for all x G C 2 , (fx and V are trivially 
injective, it is enough to prove that the union tp^ (^(C™* )) 

xGC 2 

is disjoint. 

For x, z G C 2 such that x / z, let x 6 ^(^(C^*)) and 
z G ^(^(C^)). By Proposition 5, d B (x, z) > d B (x, z) > 0, 
and thus x/z. ■ 

Corollary 1: 



\c 3 \ = Y, AdM *- 

d=0 



(32) 



Proposition 7: Let x and z be two distinct codewords of 
C 3 . Then d B (x, z) > d B , min . 

Proof: See Appendix E. ■ 

Therefore, we have constructed an [n, M, dB,min] ternary 
code C3, where M = J2d=o AdM 2 (see Proposition 6 above), 
starting from the binary codes C 2 and {C|} (0 < d < n). 

Example 6: We construct a [5,21,3] code C3 for the chan- 
nel H. First, we consider the binary code C 2 with parameters 
[5,4,3] defined by 



C 2 = {00100, 11000, 00011, 11111}. 



(33) 



Its weight enumerator has three non-zero values: A\ = 1, 
A 2 = 2, and A 5 = 1. Therefore, we require three binary 
codes C 2 , Cf , and Cf of minimum Hamming distance at least 
[§] = 2. We choose C\ = {0}, Cf = {00,11}, and Cf the 
code with generator matrix 



/0 



V 1 



1\ 




0/ 



The code C3 is obtained by applying (30). The construction 
of C3 is represented in Fig. 6. For each codeword x G C 2 
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c 2 


00100 


11000 


00011 


11111 













c 1 



0h^ 00100 



00 1 > 1 1000 
22000 



r 2 



00 H 


+00011 


11 H 


+ 00022 



00000 




11111 

11111 


oooi i 




1 1 1 99 

1 1 .1- __ 


001 1 




1 1 991 

11ZZ1 


001 01 




1 1 91 9 


01 1 00 




1 991 1 

1ZZ11 


01 1 1 1 

Ul 111 




1 9999 


01 01 

U1U1U 




1 91 91 


01001 


1— > 


12112 


11000 


1— > 


22111 


11011 


1— > 


22122 


11110 


1 — > 


22221 


11101 


1 — > 


22212 


10100 


1 — > 


21211 


10111 


1 — > 


21222 


10010 


1— > 


21121 


10001 

■ 


1— > 


21112 



Fig. 6. Example of the construction of a [5,21,3] ternary code C3 for 
channel H using binary codes as basic elements. The arrows '1— >' represent 
the application of the mappings (21) and (22) to the codewords of C2 and to 
the codewords of the codes C™ x , respectively. 



TABLE II 

Size M of ternary codes obtained using binary codes from [12] 

AS BASIC ELEMENTS. FOR EACH LENGTH n AND SIZE M 2 OF THE CODES 
C'2, WE REPORT THE SIZE M" max (RESP. M min ) OF THE LARGEST (RESP. 
SMALLEST) TERNARY CODE OBTAINED, AND THE AVERAGE SIZE M. THE 
NUMBERS BETWEEN BRACKETS ARE THE NUMBER OF BINARY CODES OF 
SIZE M2. 



71 


Mo 


1VA max 


J "min 


M 




9 (382) 


120 


95 


106 




10 (174) 


124 


99 


112 




11 (54) 


129 


107 


118 




12 (28) 


133 


115 


124 


7 


13 (8) 


137 


125 


133 




14 (4) 


141 


133 


137 




15 (2) 


145 


141 


143 




16 (1) 


149 


149 


149 




18 (35094) 


307 


246 


278 


8 


19 (431) 


311 


263 


290 




20 (10) 


309 


293 


301 




36 (38996) 


835 


677 


776 




37 (1464) 


833 


745 


792 


9 


38 (116) 


837 


111 


806 




39 (6) 


833 


809 


817 




40 (2) 


825 


825 


825 


10 


72 (1124) 


2298 


2088 


2204 


11 


144(13088) 


6653 


6195 


6586 



we obtain \C$*\ codewords of C3 through (^(^(C™*)). For 
instance, for x = 11000 and Cf = {00, 11} we obtain |C|| = 
2 codewords of C 3 by applying </?nnoo to ip(z), where z 6 
{00, 11} is one of the codewords of C 2 : 

V>(00) = 11, then pnooo(ll) = H000 
■0(H) = 22, then i^iiooo(22) = 22000. 

In total, C 3 has Ai\C\\ + A 2 \C%\ + A 5 |C|| = 21 codewords. 

Example 7: For a comparison with the [5, 21, 3] code from 
Example 6 above, we tabulate here an optimal [5,27,3] code 
C' 3 found by computer search. The code is defined by 

C 3 = {01112, 00200, 00121, 01001, 01022, 02110, 10010, 
02221, 10102, 11111, 11120, 11221, 12020, 12101, 
12202, 12211, 20020, 20012, 11212, 20111, 21100, 
21211, 21202, 22001, 22210, 22102, 22222}. 

(34) 

The constructive method proposed above gives a lower 
bound on the cardinality of ternary codes over H. We used this 
method to construct codes using extended BCH (eBCH) codes 
for C 2 and codes obtained from the tables in [10, 1 1] for {C 2 }. 
Note that the proposed construction method does not require 
full knowledge of the binary codes used as basic elements 
to compute the lower bound; given n and <iB,min, only the 
knowledge of the weight enumerator Ad of C 2 is required. On 
the other hand, for codes {C 2 }, only the knowledge of the size 
M2 is required. The results are shown in Table I. For given n 
and G?B,min, we report in the table the code size M. The upper 
bound on the size of codes over Tt of length n and minimum 
G?B-distance c?B, m in is also given in the table (numbers between 
brackets). 

The constructive lower bound is strongly dependent on the 
binary codes selected as basic elements. Better bounds than 
the ones in Table I can be obtained if we use, e.g., good non- 
linear binary codes instead of eBCH codes. Furthermore, the 



choice of the binary code C 2 , even among codes of the same 
length, minimum distance, and size, can have a great impact 
on the size of the ternary code generated. We observed that the 
overall WE of the code was crucial; maximizing the size of C 2 
may be less important than using a code with a WE adapted to 
the construction. As an example, for short block lengths n we 
report in Table II the best size (M max ), the worst size (M m i„), 
and the average size (M) of the ternary codes obtained by 
using (non-linear) binary codes of a given size and minimum 
distance 3 [12] for C 2 . The number of codes of a given size M 2 
is given between brackets in the table, and the largest tabulated 
value for M 2 for a given n is the maximum possible, i.e., the 
corresponding codes are optimal. For {Cf } we used the codes 
from [10, 11], since for minimum distance 2 they are optimal 
(their size is M 2 d = 2 d ' 1 ). For n = 11, there exist 13088 
codes of optimal size (M 2 = 144). Among them, choosing a 
code with a more suited WE can bring the size of the generated 
ternary code from M m i n = 6195 to M max = 6653. For n = 9, 
the best ternary codes are not obtained from binary codes for 
C 2 of optimal size (M 2 — 40), but from a binary code of 
size M 2 = 38. In fact, starting from an optimal code yields 
worse results than considering codes of smaller sizes, down 
to M 2 = 36. 

Remark 4: With respect to the list of codes in [12] we also 
considered the codes obtained by adding (modulo 2) the all- 
one vector to every codeword of the code, since this changes 
the WE of the code, which may have a great impact on our 
construction. 

VIII. Variable-length to Fixed-length Encoding 

The construction method proposed in the previous Section 
provides codes as sets of codewords. However, finding a 
simple encoding from the set of messages (thought of as 
binary words of fixed length) to the set of codewords is a 
difficult problem. Notice that by construction the resulting 
ternary codes are non-linear. While it is always possible to 
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TABLE I 

Constructive lower bound obtained using eBCH codes as basic elements and upper bound (in brackets) on the size M of ternary 

CODES FOR DIFFERENT VALUES OF THE BLOCK LENGTH n AND MINIMUM rfg -DISTANCE cfB.min ■ 



n 





8 




16 




32 




64 




128 




2 


3281 


(6561) 


2.15E7 


(4.30E7) 


9.26E14 


(1.85E15) 


1.71E30 


(3.43E30) 


5.89E60 


(1.17E61) 


4 


241 


(729) 


675681 


(2.53E6) 


1.44E13 


(5.61E13) 


1.34E28 


(5.28E28) 


2.30E58 


(9.13E58) 


6 






5985 


(281351) 


2.84E10 


(3.30E12) 


6.55E24 


(1.60E27) 


2.81E54 


(1.40E57) 


8 


17 


(41) 


2529 


(45169) 


5.33E8 


(2.84E11) 


5.14E22 


(7.17E25) 


1.09E52 


(3.22E55) 


10 














4.72E19 


(4.23E24) 


1.29E48 


(9.77E53) 


12 










2.74E6 


(4.43E9) 


3.12E18 


(3.09E23) 


5.17E45 


(3.68E52) 


14 














3.13E15 


(2.68E22) 


2.12E42 


(1.65E51) 


16 






33 


(253) 


133057 


(1.37E8) 


1.00E14 


(2.68E21) 


8.22E39 


(8.61E49) 


20 


















3.42E35 


(3.37E47) 


22 














7.30E10 


(5.40E18) 


8.53E31 


(2.47E46) 


24 














6.92E10 


(8.26E17) 


3.20E29 


(1.98E45) 


28 














1.07E9 


(2.44E16) 


1.00E26 


(1.61E43) 


32 










65 


(7817) 


2.68E8 


(9.54E14) 


4.25E23 


(1.74E41) 


44 


















1.84E19 


(8.52E35) 


48 


















1.15E18 


(2.09E34) 


56 


















1.12E15 


(2.03E31) 


64 














129 


(5.85E6) 


8.79E12 


(3.39E28) 


128 


















257 


(2.47E12) 



enumerate the codewords, for large values of M coding and 
decoding become far too complex. 

To circumvent this drawback, we can consider a variable- 
rate encoding alternative. Consider the [n, M, <iB,min] ternary 
code C 3 constructed following the construction method of the 
previous Section starting from the [n, M 2 , dB, m in] binary code 
C 2 and the [d, M 2 , dp^min] binary codes {C 2 } with dn.min > 
dB, 2 "" n . Assume also that efficient encoders and decoders 
are known for these binary codes over the BSC for C 2 and 
over the BEC for the codes {C 2 }. Let us define the message 
m to be transmitted as an infinite sequence of bits. A simple 
way to progressively encode pieces of m by C3 is as follows: 

1) Let Ui denote the prefix of length k bits of m. 

2) Let xi denote the codeword associated to Ui by C 2 , and 
Wxj its Hamming weight. 

3) We consider the [w Rl , M 2 X1 , dH, m in] binary code C 2 X1 
with c?H,min > [dB,min/2]. Let k w ~ denote the infor- 
mation block length of C^ 1 , and let u 2 be the next 
k w - bits of m and x 2 the codeword associated to u 2 

byC^ 1 - 

4) Transmit x = (p Rl (^>(x 2 )) over H and remove the first 
k + k Wjii bits of m. 

5) Go back to step 1. 

This encoder outputs a sequence of ternary words of length 
n that are decoded after transmission over H by a decoder 
that works with the following pattern: 

1) Consider the first received block of n ternary symbols, 

y- 

2) Let yi denote the word obtained by replacing every 
occurrence of symbol 2 by the symbol 1 in y. 

3) Let Ui denote the output of the decoder of C 2 corre- 
sponding to yi (the estimate of Ui), Xi the codeword 
associated to Ui by C 2 , and the weight of xi. 

4) Let y 2 = ip~ x ^I 1 (xi * y)^, where xi *y denotes the 
element-wise product of the two vectors. 

5) Let u 2 denote the output of the decoder C 2 X1 corre- 
sponding to y 2 . 



6) Output the concatenation of Ui and u 2 , and go back to 
step 1 to decode the next block. 

Proposition 8: On every packet sent using an 
[n, M, de.min] code C3, if less than tx = B ' m 2 "'~ 1 
errors occur, the message is correctly decoded. 

Proof: See Appendix F. ■ 

The first drawback of this effective transmission scheme 
is inherent to the variable-length to fixed-length setting. For 
finite messages m, the length of m will not always match the 
required information block length of the code. In this situation, 
some dummy symbols must be appended to the message prior 
to encoding. While this is not especially a hard problem (a 
simple solution is to append at the end of the message the 
symbol 1, and as many symbols as needed to reach the 
required size, which is easy to code and to decode), it suffers 
from an efficiency loss that increases as the average number 
of blocks sent per message decreases. 

The second obvious drawback is that if more than £a errors 
occur on the same block, it is very likely that the decoder of 
C 2 will decode on a codeword of wrong weight, which would 
result in a shift of the rest of the decoded blocks. The risk of 
losing such amount of data is affordable only in applications 
in which any error in the whole message compromises its use, 
such as the binaries of a software. 

IX. Extension to ^-ary Channels 

In this Section, we extend the construction method of 
Section VII to g-ary codes for the g-ary generalization of the 
ternary channel K. 

Definition 5: For q > 3, let TL q be the channel characterized 
by input alphabet X — {0, . . . , q — 1}, output alphabet y = 
{0, . . . , q — 1}, and the set of conditional probabilities p(y\x) 
such that for x E X and y e J: 

!1 — p if x = y = 0, 

l--^r ifx = y^0, 

^ ifz^anda^O, (35) 

otherwise. 
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Let u be a vector of Z£ of Hamming weight w u . We extend 
the mappings defined in Section VII to the new g-ary setting: 



and 



where 



V>: (Z,_!U{?}y 



Z£ 



Z£ 



V>(?) = and ^(i) = i + 1, < i < g - 2. 



(36) 



(37) 



(38) 



We call £« the subspace of Z™ defined by = ip u (Z™»). 
These extended mappings maintain the topological properties 
that they have for the ternary case regarding d B (see Sec- 
tion VII-B). Notice that the definition of the distances d& 
and d B does not need to be changed to consider the distance 
between vectors in Z™. 

Proposition 9: Let u and v be two vectors in Z"_ x . It 
follows that 



d B (^(u),^(v)) = 2d B (u,v). 



(39) 



Proposition 10: Let u G Z£. For u,u' G Z™ u , the follow- 
ing equality holds: 



dB(Vu(u),Vu(u')) = d B (u,u'). 



(40) 



Proposition 11: Let u, v G Z!j, and let u G Z™ u and v G 
Z™ v , both with no zero entries. The following inequality holds: 



d B (^ u (u),^ v (v)) > d B (u,v) 



(41) 



Propositions 9 to 11 can be proved in the same way as 
Propositions 3 to 5 in Section VII-B, respectively. 

The goal is now to construct, with n and minimum d B - 
distance d Bim ; n given, an [n, M, d B , m ; n ] code C q for the q- 
ary channel H q with reasonable M, starting from elementary 
codes for the binary symmetric channel and for the (q — l)-ary 
erasure channel as basic elements. 

Let C 2 be an [n, M 2 , d Bim i n ] binary code with minimum 
Hamming distance dn.min = <fe,min and denote by Ad its 
weight enumerator. For all d such that Ad ^ 0, let C q _ 1 be a 

[d, M^^dn.min] (<?-l)-ary code with d H ,min > rd B , m in/2]. 
We consider the g-ary code C g obtained as follows: 

C, = |J ^ ( W-i)) • ( 42 ) 

xGC 2 

Proposition 12: The cardinality of code C g satisfies: 



E 

d=0 



9-11- 



Corollary 2: 



\C q \=Y J A d M d q _ 1 . 



(43) 



(44) 



Proposition 13: Let x and z be two distinct codewords of 
C g . Then d B (x,z) > d B , m in- 



The adaptation of the proofs of these propositions for the 
ternary case to the g-ary case is straightforward, and details 
are omitted for brevity. 

Remark 5: The variable-length encoding process in Sec- 
tion VIII for the ternary case can be directly adapted to the 
q-ary case (q > 3) when q is of the form 2 e + 1 with I > 1 
by reading symbols of Z g _i as groups of I bits of the binary 
input. If q is of another form, this direct adaptation is still 
possible but involves an efficiency loss (the number of bits 
read is the highest I such that 2 £ + 1 < n, and some symbols 
of Z q ^i are not used). 

X. Ternary Code Search 

In this Section, for small values of n, we compare the 
constructive lower bound on the size of ternary codes in 
Section VII with the results of a computer-based search for 
good ternary codes. We have conducted both an unrestricted 
code search for ternary codes and a search based on the code 
construction from Section VII. As we will show below, in 
both cases, the code search reduces to the problem of finding 
(weighted) cliques in an undirected (weighted) graph, which 
has been solved using state-of-the-art algorithms from graph 
theory. Both exhaustive algorithms (when the code parameters 
n and dB, m in are small) and a greedy approximate algorithm 
(for larger values of the code parameters) have been used. 

A. Unrestricted Ternary Code Search 

Let G = G(V, E, (d B , m in, Wmin, Wmax)) denote an undi- 
rected graph with vertex set V and edge set E, where each 
vertex v(a) represents a ternary vector a G Z3 with Ham- 
ming weight at least w m ; n and at most w max - Furthermore, 
(v(a),v(b)) G E if and only if d B (a, b) > d Bim ; n . Then, an 
[n, M] ternary block code with codewords of Hamming weight 
at least u> m j n and at most w max and minimum d B -distance at 
least d B , m ; n corresponds to a clique (i.e., a subgraph in which 
all pairs of vertices are adjacent) of size M in the graph G, 
and a maximum-size (or optimal) [n, M] ternary block code 
with codewords of Hamming weight at least w m i n and at most 
iD max and minimum d B -distance at least d Bjm i n corresponds 
to a maximum clique in the graph G. Thus, the code search 
problem reduces to finding cliques in an undirected graph. 

B. Restricted Ternary Code Search Based on the Binary Code 
Construction 

Let G = G(V,E,(d Bymin ,w m i n ,w m!ix )) denote an undi- 
rected weighted graph with vertex set V and edge set E, 
where each vertex v(a) represents a binary vector a G Z£ 
with Hamming weight at least w m ; n and at most w max - 
Furthermore, the weight of a vertex v(a) is the size of 
an optimal binary code of length w a , where w a denotes 
the Hamming weight of a, and with minimum Hamming 
distance at least 



Also, (v(a),v(b)) G E if and 
only if d B (a, b) > d B m i n . Then, an [n,M] restricted ternary 
block code with codewords of Hamming weight at least w m i n 
and at most w max and minimum d B -distance at least d B min 
corresponds to a clique of weighted size M in the graph G, 
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and a maximum-size (or optimal) restricted [n, M] ternary 
block code with codewords of Hamming weight at least w m in 
and at most w maK and minimum de-distance at least dB,min 
corresponds to a maximum weighted clique in the graph G. 
Thus, the code search problem reduces to finding (weighted) 
cliques in a graph. 

Finding the weights of the vertices in the graph above is a 
difficult problem, since they correspond to the sizes of optimal 
binary codes. Obviously, we can find these weights by carrying 
out several code searches in a similar fashion as described 
above, where the vertex set of the ith graph, i = 1, . . . ,n, 
corresponds to a set of binary vectors from "L\. Also, in the ith 
graph, (v(a),v(b)) G E if and only if dn(a, b) > dB 2 "" n . 

C. Exhaustive Search for Maximum Cliques 

The problem of finding a maximum clique (or the equivalent 
problem of finding a maximum stable set) in an arbitrary 
undirected graph is one of the most important NP-hard 
problems in discrete mathematics and theoretical computer 
science. There are several efficient algorithms for searching for 
maximum cliques in arbitrary graphs. See, for instance, [13- 
15] and references therein. One standard approach for finding 
a maximum clique is based on the branch-and-bound method 
[16]. Most branch-and-bound algorithms use heuristic coloring 
algorithms to find an upper bound on the size of a maximum 
clique in the bound step. Sophisticated coloring algorithms can 
reduce the search space significantly. For very dense graphs 
(or very sparse graphs for the equivalent problem of finding a 
maximum stable set), the branch-and-bound algorithm in [14] 
is one of the fastest known algorithms, and we have used this 
algorithm in combination with the algorithm from [13], which 
is faster when the graph is not that dense, when searching for 
ternary block codes. 

Finally, we remark that most algorithms for finding a 
maximum clique can be straightforwardly extended to finding 
a maximum weighted clique. 

D. Greedy Search for Maximum Cliques 

There are numerous heuristic or approximate algorithms in 
the literature for searching for maximum (weighted) cliques 
in an arbitrary (weighted) undirected graph. See, for instance, 
[17] and references therein. In this work, we have used a 
very simple greedy algorithm, which is outlined below in 
Algorithm 1. 

The greedy algorithm in Algorithm 1 is a random algorithm, 
since there could be several vertices v in line 8 that have 
maximum degree. Thus, it is beneficial to run the algorithm 
several times. 

In the setting of a weighted graph, priority is given to the 
vertex of maximum degree (as for the unweighted case), and 
if there are several vertices with degree equal to the maximum 
degree, priority is given (in a random fashion) to the vertices 
with the highest weight. 

E. Results From a Code Search 

When n is not very small, the size of the unrestricted graph 
described in Section X-A becomes very large. Thus, to reduce 



Algorithm 1 Greedy Maximum Clique Search 
1: /* Find an approximate maximum clique in an arbitrary 

undirected graph G = G(V, E) */ 
2: Construct the complement G = G(V, E) of G, where an 

edge e e E if and only if e ^ E. 
3: Initialize the set V with the empty set. 
4: while V ^ do 

5: if 3v e V of degree at most 1 then 

6: select a random vertex v e V of degree at most 1 

and add it to V, i.e., let V <- V U {v} 
7: else 

8: select a random vertex v 6 V of maximum degree 

(> 1) and add it to V, i.e., let V <- V U {v}. 
9-. end if 

10: Remove v and all its adjacent edges from G. 
ii: end while 

12: The vertex set V is an independence set in the original 
undirected graph G, and it follows that V is a clique in 
the original undirected graph G. 



the size of the graph, we have, in some cases, restricted the 
code search to constant-weight codes, i.e., to codes in which all 
codewords have the same Hamming weight (w m i n = w max = 
w, for some w), or to nearly constant-weight codes, where 
w m i n is larger than and/or w max is less than n, and w maK — 
w m in is small compared to n. Also, when the size of the graph 
becomes too large, the greedy algorithm from Algorithm 1 is 
used instead of a much more complex exhaustive algorithm. 

In Table III, the size M of both restricted and unrestricted 
ternary codes for different values of the block length n and the 
minimum distance dB,min are presented. The numbers in the 
parentheses are from an unrestricted search for ternary codes, 
and should be compared to the numbers in front that are from 
a restricted search, as described in Section X-B. The numbers 
in bold are exact values (from an exhaustive search) for non- 
constant-weight codes, i.e., with w m i n = and w max = n. 
As can be seen from the table, in some cases, the binary code 
construction gives optimal ternary codes. 

Notice that the output of this ternary code search is not 
purely numerical and actually yields codes as sets of code- 
words. While this may not always be a valuable information 
(especially for codes with no known form of regularity), in 
our case it provides binary codes C2 that yield good or even 
optimal ternary codes. It also provides the families C|, but 
only the size of these matter, so the problem of finding the 
best C2 is the same as the one of finding the largest binary 
codes of length d, for a given minimum distance. 

XI. Conclusion 

In this paper, coding for a non-symmetric ternary channel 
where some transitions are not allowed was addressed. We 
derived the ML decoding rule for this channel and showed 
that it is complex to compute, since it depends on the error 
transition probability p. We then proposed an alternative 
decoding rule, called c?A-decoding, based on a more suitable 
distance measure which does only depend on code properties. 
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TABLE III 

Constructive lower bound on the size M of ternary codes for different values of the block length n and minimum c( b -distance 

^B.min WHEN USING A CONSTRUCTION BASED ON BINARY CODES COMBINED WITH A BINARY CODE SEARCH. THE NUMBERS IN THE PARENTHESES ARE 
FROM AN UNRESTRICTED SEARCH FOR TERNARY CODES, AND THE NUMBERS IN BOLD ARE EXACT VALUES (EXHAUSTIVE SEARCH). 



n 




5 




6 




7 




8 




9 




10 


11 


2 


122 


(122) 


365 


(365) 


1094 


(1094) 


3281 


(3281) 


9842 


(9842) 


29525 


88574 


3 


21 


(27) 


54 


(61) 


149 


(168) 


337 


(383) 


937 


(990) 


2306 


6581 


4 


17 


(17) 


38 


(40) 


92 


(94) 


241 


(272) 


545 


(607) 


1482 


3476 


5 


5 


(7) 


9 


(14) 


17 


(26) 


25 


(53) 


50 


(117) 


106 


277 


6 






9 


(12) 


17 


(18) 


21 


(35) 


46 


(77) 


82 


188 


7 










9 


(9) 


17 


(17) 


21 


(25) 


41 


77 


8 














17 


(17) 


21 


(21) 


41 


73 


9 


















7 


(11) 


13 


25 


10 






















13 


25 


11 
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We showed that under certain conditions g?a -decoding and 
ML decoding rules are equivalent. Further, we analyzed error 
correcting capabilities of ternary codes over this particular 
channel under c£a -decoding. We derived an upper bound and 
a constructive lower bound on the code size, showing the 
existence of good codes. Following the proposed constructive 
method, we found good codes for several values of n and 
cfe.min- The proposed construction method was also extended 
to q-aiy generalizations of the non-symmetric ternary channel. 
Finally, the constructive lower bound was compared with 
results from a clique-based search for optimal ternary codes 
for small code lengths. It is shown that in some cases the 
proposed construction method gives optimal codes. 
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Appendix A 
Proof of Theorem 1 

We prove the direct implication and its converse. 

• Direct implication: 

We assume that (13) does not hold. For n odd, we write 
n = 2m + 1. Consider the code C = {0„, 1„} consisting 
of two codewords, the all-zero codeword and the all- 
one codeword, and the received vector y = m+ \l m 
consisting of m + 1 zeros and m ones. Clearly, cIa- 
decoding decodes y to the all-zero codeword 0„. On the 
other hand, 

Kyio„) = (i- ? ,r+ i ( ? ,/2r , 

p{y\l n ) = {p/2) m+1 (l-p/2) m . 

Using the hypothesis, we obtain p(y\Q n ) < p(y|l n ). 
Therefore, ML decoding will not necessarily decode to 
0„. 

If n is even, we use the same argument considering the 
same vectors with an extra zero appended at the end. 

• Converse: 

We consider a word y e of weight w y . For a 
given d and a codeword x such that d\ (x, y) = d, the 
conditional probability that y was received knowing that 
x was transmitted is 

n 

p(y\ x ) = ]Jp(yi\xi) 

»=i (46) 

-(!)V»>-(i-£)"— 

where a = \{i :xi=yi = 0}|. 

Because 1— p < 1 — |, if x varies with n, w y , and d fixed, 
the probability (46) decreases for increasing values of ao- 
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Notice that ccq satisfies ao < min{n — d,n — w y }, where 
n — d is the number of symbols that are equal in x and y, 
and n - w y is the number of symbols in y. Therefore, 
p(y\x), for given n, w y , and d, can be lower-bounded by 

p(y|x) > B-(n,w y ,d) 
'( 

(f) d (l-p)" 



(|) d ( 1 -|) U ' y d (l-p) n - w * if d<w y , 

otherwise. 
(47) 



Similarly, since ao > max{0, n — w y — d}, for given n, 
w y , and d, p(y|x) can be upper-bounded by 

p(y|x) < B + (n,w y ,d) 

= { (it I 1 - f )"* C 1 - p)"" 1 "^ if d < n - w y , 
I (l)^ — 2)" d otherwise. 

(48) 

We now prove that if (13) holds, then B + (n, w yi d+l) < 
B~(n,w y ,d) for all n, w y , and d. Thus, we prove that 
the closest codeword to a received vector y is always the 
most likely one. For the sake of simplicity, we will denote 
these two bounds by = B + (n,w y ,d + 1) and 

BJ = B~(n,w y ,d), respectively. There are 4 subcases 
depending on the value of d compared to w y and to 



- For 



d < 



b: 



d + 1 < n — w y 

= (l) d 0--§) Wy ~ d O--p) n - v '* 

\d+l. 



Thus, -B^j < Bj if and only if 



p/2 < f 1-p 



(49) 



1-p ~ \l-p/2 / 

To prove (49), it is sufficient to show that d < 
because by hypothesis 

i-p \ L^J 



g/2 

1-p " V 1 -P/ 2 



< 



and because 



1-P 
l-p/2 

In this subcase of the proof 



< 1. 



d < n — w y — 1, 

d < Wy. 

Therefore, 2d < n - 1, and d < [^J • 



(50) 



(51) 



(52) 



— For { j 1 1 ^ 

a + 1 > n — Wy 



B 



= (f) a I 1 -!)"' 5 ' ^ (l-p) n_,0y 

^1= (5)^(1 -fr"" 1 - 



Thus, Bj +1 < B d if and only if 

p/2 



\ n — w y — 1 

I ^— 4 ) • (53) 



1-p \l-p/2 



The proof of (53) is similar to the proof of (49): it is 
sufficient to show that n — w y — 1 < ■ Since 

in this subcase 



n — Wy < d + 1, 
n — w y < n — d — 1, 



(54) 



we have 2(n — w y ) < n, and n — w y — 1 < § — 1 < 

- For ( I ^ : 

[ d + 1 < n — w y 

f =(f) d (l-p)- d , 

I s+ +1 =(D d+1 (i-f)^(i-pr-^-i. 

Thus, < if and only if 



p/2 



< 



1-p 



1-p \l-p/2 



(55) 



Again, to prove (55), we show that w y < [^Y^J- 
Now, 

d-2, 
d, 



Wy < n 



W.'v 



< 



(56) 



so we have 2w y < n — 2, and w y < L 1 ^] • 

l; f '/ 
- For 



d+1 > n 



B 



w v 



\n—d 



1 ^(^(i-sr" -1 

Thus, < if and only if 



p/2 



< 



p 



n — d— 1 



(57) 



1-p ~ VI 

Again, to prove (57), we show that n — d — 1 < 



\j L 2^\ ■ As in this subcase, 



n — d < w y + 1, 
n — d < n — w v , 



(58) 



we have 2(n - d) < n + 1, and n - d - 1 < [^J . 

Appendix B 
Proof of Proposition 1 

Let x be a codeword of C3 transmitted over the ternary 
channel K, and let y be the received vector at the output of 
the channel. If a decoder implementing the <iA-decoding rule 
erroneously decodes y to x / x, then 



^A(x,y) > d A (x,y). 
Using (59) and Definition 3, 

2d A (x,y) > d A (x,y) +d A (y,x) > d B (x,x) 

> dB.min > 2 



dB,i 



1 



Therefore, we successfully d A -decode y if 

dB,min 1 

d A (x,y) < 



(59) 



(60) 



(61) 
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where d A (x, y) is the number of errors that occurred during 
the transmission of x. 

Conversely, by Definitions 3 and 4, there exist two code- 
words x and x and a vector y eZj such that 



(62) 



<fe(x,x) = d B ,min 

d B (x, x) = d A (x, y) + d A (y , x) . 
Therefore, if d A (x, y) > t A , then 

dA ( Xiy) >^l^ rf A(x,y)+rf A (y,x)_ (63) 

Thus, c?A(y,x) < c?a(x, y), and the dA-decoder may fail to 
decode y to x. 



Appendix C 
Proof of Proposition 2 

We first prove that the smallest spheres are the ones centered 
on words of maximum weight (the vertices of the hypercube). 

Let n and r be two integers. For w < n, let be a vector 
of Zg of weight w. The volume of S(u W7 r) is independent 
from the choice of u w . We denote it by V(n, w, r). For n > 0, 
we denote by the vector of Zg -1 obtained by removing 
the last symbol of u w : 

V(n,w,r) = |{veZ 3 l : d B (u w ,v) < r}\ 

= |{w0 : w G Z™- 1 Ad B (u» < r}\ 
+ |{wl : weZJ-'AdeK.w) < r- — 1}| 
+ |{w2 : weZ™" 1 Ad B (u' w ,w) <r-l}| 
= V(n- l,w,r) +2V(n- l,w,r- 1), 

(64) 



where for w e 



wO denotes the vector of Z 3 l obtained 



by appending a at the end of w. 
Similarly, we show that for w < n 



1, 



V(n,w + l,r) = V(n — 1, w, r) + V(n — 1, w, r — 1) 
+V(n- l,w,r-2). 

(65) 

Therefore, if n > and u) < n — 1, 

V(n, ui, r) — V(n, w + 1, r) 

= V(n - l,io,r- 1) - V(n - 1, to, r - 2) > 0. 

(66) 

From (66), it follows that the spheres of minimal volume are 
the ones centered on words of maximum weight. 

Now, we give an expression for V(n, n, r). We consider the 
all-one vector l n . Let v e Z3 : v is in <S(l n , r) if and only if 
G?B(ln,v) < r. We denote by d this distance, and by e2 the 
number of positions i where Vi = 2. The number of positions 
j where vj = is d — 2e2- The number of vectors v that 
match a given d and e 2 is therefore: 



n - e 2 
d-2e 2 



We conclude by summing over all possible d and e 2 : 

r |d/2j 

|5(l»,r)| = EE 



d=0 e 2 =0 



n - e 2 
d- 2e 2 



(67) 



Appendix D 
Proof of Theorem 2 

Let x and x be two codewords of C3. Since d B (x, x) > 
2tA + 1, the spheres <S(x, t A ) and 5(x, £a) are non- 
intersecting. This implies that 



U ^( x ^a) 
xec 3 



= e i^a)! 

xec 3 

t A L^/2J 

> NEE 

d=0 e 2 =0 



Furthermore, 



U <S( x ^a) 

xSC 3 

Therefore, we conclude that 

|C 3 | < 



< 1^3 I 



3™ 



n \ / n — e 2 
£2) \d- 2e 2 



3". 



(68) 



(69) 



tA Ld/2J 

EE 

d=0 e 2 =0 



n \ / n - e 2 
e 2 / _ 2e 2 



Appendix E 
Proof of Proposition 7 

Let x and z be two distinct codewords of C3 . We denote by 
x the codeword of C 2 and by x' the codeword of C^* such 
that x = ip^(tp(x')) (the unicity is proved in the proof of 
Proposition 6). Likewise, we define z and z' with respect to 
z. We consider two cases: 

• Case x = z: In this case, x' and z' are two different 
codewords of (otherwise x = z). Thus, by choice 
of the code C™* it follows that d H (x',z') > [rf B ,min/2]. 
By Propositions 3 and 4, 

d B (x,z) =d B (^x(V(x')),^(V(z'))) =d B (V(x'),^')) 

= 2d B (x',z') > 2rd B . min /2] > d B , min . 

(70) 

• Case x ^ z: By choice of C 2 it follows that d B (x, z) > 
dB.min- Now, by Proposition 5, 



d B (x,z) = d B (^x(-0(x')),</?z(-0(z'))) > d B (x,z) 

— ^B.min- 



(71) 



In both cases, d B (x, z) > d B , m i n , which concludes the proof. 

Appendix F 
Proof of Proposition 8 

We prove that, if d(x, y) < t A , then y is properly decoded, 
i.e., Ui = Ui and u 2 = u 2 . 



. ui = m: 



d(xi,yi) = ^2d(xu,yu) 

i=l 

= \{i ■ x u ^ yu}\ 
= \{i : x t ^ yi}\ 
= cfe(x,y) < t A 



(72) 
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Therefore, since xi is a codeword of C 2 which has an error 
correction capability t\, y is successfully decoded to Ui = m. 
• u 2 = u 2 : 

Since ui = ui, we have xi = xi and = % r 

We prove that y 2 can be decoded to u 2 using the decoder 

of code C 2 X1 ■ Consider a position j, 1 < j < w Rl : 

- Either yij = %2j, 

- Or y 2 j ^ X2j and y 2 j = ?, which involves that xi * 
y has a at coordinate i = g^ x (j) (the jth non- 
zero entry of xi). Since xu = 1, yi = 0. Also, 
since Ui = m, then xi = xi, which implies that 

Q&iti) = 9*iU) = *■ Thus > ^ ^ and x, ^ y it 



with yi = since yij = ?■ By assumption there are 
at most tA coordinates such that Xi ^ yi. Therefore, 
we conclude that \{j : j/ 2j ; =?}| < tA, 
- Notice that the case y 2 j 7^ %2j and y 2j - ^ ? is not 
possible. Following the reasoning of the previous 
case, it would involve that x and y are different but 
both non-zero at a given coordinate, which is not a 
possible transition in our channel Ti. 

Therefore, no error and at most tA = rfB - m " 1 ~ 1 erasures 



differentiate y 2 from x 2 . Since the minimum distance of 



is such that 



> tA, its decoder can correct 



these at most tA erasures, and successfully estimate u 2 . 



